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Abstract. This paper considers the problem of showing that every pair of 
binary trees with the same number of leaves parses a common word under a 
certain simple grammar. We enumerate the common parse words for several 
infinite families of tree pairs and discuss several ways to reduce the problem 
of finding a parse word for a pair of trees to that for a smaller pair. The 
statement that every pair of trees has a common parse word is equivalent to 
the statement that every planar graph is four-colorable, so the results are a 
step toward a language theoretic proof of the four color theorem. 



1. Introduction 

Let G be the context-free grammar with start symbols 0,1,2 and formation rules 
-> 12, -> 21, 1 -> 02, 1 -> 20, 2 -> 01, 2 -> 10. An ra-leaf tree T parses a 
length-n word w on {0, 1, 2} if T is a valid derivation tree for w under the grammar 
G; that is, there is a labeling of the vertices of T compatible with the formation 
rules such that the leaves of T, from left to right, are labeled with the letters of w. 
For example, the tree 




parses the word 0110212, as can be seen by this labeling: 




Note that labeling the leaves of a tree uniquely determines a labeling of the internal 
vertices under G if a valid labeling exists. 

Certainly G is ambiguous — there exist distinct trees that parse the same word; 
for example, the trees 
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both parse 010. To take a somewhat larger example, the trees 
both parse the word 0110212: 



l 




10 ^ ^2 



2 1 2 110 




However, something much stronger can be said about this grammar. 
Theorem 1. The grammar G is totally ambiguous. 

That is, every pair of derivation trees with the same number of leaves has at least 
one word that they both parse. Kauffman [3] proved this theorem (in a slightly 
different form, as we describe below) by showing that it is equivalent to the four 
color theorem — the statement that every planar graph is four-colorable. The 
four color theorem was proved by Appel, Haken, and Koch [1] [2] using substantial 
computing resources. The hope of the present authors is that a direct proof of 
Theorem [T] will be shorter than the known proofs of the four color theorem, thereby 
providing a shorter proof of the four color theorem. 

In this paper we describe first results in this direction. First we show that 
Theorem [l] is equivalent to Kauffman's formulation. Section [3] determines explicit 
common parse words for several simple parameterized families of tree pairs. In 
Section [4] we establish existence of parse words for more general families. In Sec- 
tion [5] we enumerate the common parse words of a 3-parameter family of tree pairs. 
We conclude in Section [6] by discussing in more generality methods of reducing the 
problem of finding a common parse word for a pair of trees. 

A Mathematica package [5] and a Maple package [6j that accompany this paper 
and facilitate the discovery of the results we present can be downloaded from the 
respective web sites of the second and third authors. 

2. Relationship to the cross product 

The set of possible derivation trees under G is the set of binary trees — trees in 
which each vertex has either or 2 children. (All trees in the paper are rooted and 
ordered.) 

Let \w\ be the length of the word w, and let \w\i be the number of occurrences 
of the letter i in w. 

Proposition 2. Let w be a word of length n on {0, 1,2} and T an n-leaf binary 
tree that parses w. Then for some permutation (r, s, i) of (0, 1, 2) , 

\w\ s = \w\ t ^ \w\ r = \w\ mod 2. 

Moreover, the root of T receives the label r when parsing w. 
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Proof. The congruence holds for the three words of length 1, and the derivation rules 
of G preserve it because all four terms change parity with each rule application. □ 

It follows that if the parities of \w\q, \w\ 1, and \1v\2 are equal then no tree parses 
w under G. If on the other hand the parity of \w\ r differs from the other two, then 
r is an invariant of w in the sense that any tree parsing w has its root labeled r. 

Kauffman 4 formulated Theorem [I] not in terms of a grammar but in terms 
of the cross product on the standard unit vectors k in R 3 . The cross product 
satisfies 

1 x 3 = (-«) x (-j) = (-j) x i = j x (-i) = k, 
3 x k = (-j) x (-k) = (-k) x]=kx (-j) = i, 
k x 1 = (— k) x (— 1) = (— i) x k — i x (— k) = 3. 

Further, for every vector v £ M 3 we have 

v x v = v x (— v) = 0, 
ux0 = 0xu = 0. 

The cross product on K 3 is not associative, so in general the expression v\ x v 2 x 
■•• x v n is ambiguous; to evaluate it for a given tuple (vi, V2, ■ ■ . , V n ) we must 
choose an order in which to compute the n — 1 cross products. Let us call such 
an order an n-bracketing. Kauffman showed that the four color theorem is equiv- 
alent to the statement that for every pair of n-bracketings there exists an n-tuple 
(ui, t>2, ■ • ■ , v n ) £ {£, j, k} n such that the two bracketings of v\ x u 2 x ■ ■ ■ x v n evaluate 
to the same nonzero vector. 

We now develop tools to show that Theorem[T]is equivalent to Kauffman's state- 
ment. Roughly speaking, we show that we can replace ±i — > 0, ±j — > 1, and 
±k — > 2. It is easy to see that n-bracketings are in bijection with n-leaf binary 
trees. Therefore, given an n-bracketing of V\ x v-i x • • • x v n , one may label each 
internal vertex of the corresponding binary tree with the cross product of the labels 
of its children (in order) . The condition that the bracketing does not evaluate to 
is equivalent to the condition that the evaluation does not encounter the product 
v x v or v x (— v), hence our formation rules for the grammar G. 

Each («x, 1)2, ■ ■ ■ , v n ) £ k} n possesses an invariant analogous to that of 
Proposition [2j To see what this invariant is, consider the quaternion group, whose 
elements are Q = {1, k, — 1, —i, —j, —k} and whose binary operation • satisfies 

i- 3 = (-«) ' (-j) = (-j) ■i=j- (-«) = k, 
3-k = (-j)-(-k) = (-k)-j = k-(-j)=i, 
k ■ i = (— k) ■ (—i) = (— i) ■ k = i ■ (— k) = j, 

as well as identities such as (—1) • (—1) = 1 and (—1) • i = — i suggested by the 
notation. Further, for q £ k, —i, — j, —k} we have q ■ q = —1 and q ■ (—q) = 1. 
Consider 4> ■ {hj, k, —i, — j, —k} — > {i,j, k, —i, — j, —k} mapping 

<j)(i) = i, <j>(-i) = -i, 

4>{3) = j, <t>{-3) = -3, 

<f>(k) = k, <l>(-k) = -k. 
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The map is a "partial homomorphism" in the sense that <j){v\ xv 2 ) = <p{vi)-^)(v2) if 
V\ ^ v 2 and Vi 7^ —v 2 . This property allows us to establish the following invariant. 

Proposition 3. Let (yi, V2, ■ ■ • ,v n ) £ {i, j, k} n , and choose a bracketing of vi x 
i>2 x • • • x v n that does not evaluate to the zero vector. Then this bracketing evaluates 
to • (j){v 2 ) ■ ■ ■ </>(««))■ 

Proof. Since the bracketing of v\ x v 2 x • • • x v n does not evaluate to 0, each 
of the n — 1 cross products is an operation on two linearly independent vectors. 
Therefore we may emulate the evaluation of the bracketing in Q rather than in R 3 , 
because replacing x with • is consistent with (j). Since • is associative, the bracketing 
evaluates to 4>(vi) ■ 4>(v 2 ) ■ ■ ■ 4>(v n ) in Q. Moreover, since we do not encounter in 
R 3 , we do not encounter —1 or 1 in Q; in particular, 4>{v\) ■ 4>(V2) • • • <j>[vn) £ 
k, —i, ~j, — fc}, and therefore <\r x (4>(vi) ■ 4>{v 2 ) ■ ■ • 4>(v n )) exists. □ 

A result of Proposition [3] is that we can drop the condition in Kauffman's state- 
ment that the two bracketings evaluate to the same vector. Therefore the four 
color theorem is equivalent to the statement that for every pair of n-bracketings 
there exists an n-tuple (v\,V2, ■ ■ ■ ,v n ) € k} n such that the two bracketings of 
v\ x v 2 x • • • x v n evaluate to nonzero vectors. 

Consider the homomorphism a : Q — > Q/{1, —1} = V, where V = {e, 0, 1, 2} is 
the Klein four-group, e = {1, —1} is the identity element, = {i, —i}, 1 = {j, —j}, 
and 2 = {fc,— k}. Let t : {i,j,k} — > {0,1,2} be defined by t(v) — a((j>(v)). In 
other words, r removes the hat and forgets the sign. Since a is a homomorphism, 
evaluating T\ and T 2 at {v\,v 2 , . . . ,v n ) € k} n results in nonzero vectors if and 
only Ht(vi)t(v 2 ) ■ ■ ■ r(v n ) is a parse word for 7\ and T 2 . Therefore, for n-leaf binary 
trees T± and T 2 , the n-tuples {v\,v 2 , . . . ,v n ) £ {i,j,k} n that evaluate to nonzero 
vectors when bracketed by T\ and T 2 are in bijection with words w S {0, 1, 2} n that 
are parsed by both T\ and T 2 . It follows that Theorem [T] is equivalent to the four 
color theorem. 

3. Parameterized families 

In this section we introduce several families of binary trees and enumerate the 
parse words of several pairs of these trees. First we establish some additional 
terminology. 

If T parses a word w on {0, 1,2}, then T also parses all words obtained from 
w by permuting the letters in the alphabet. Let Parse Words(Ti, T 2 ) be the set of 
equivalence classes (under permutations) of words parsed by both trees T\ and T 2 . 
We abuse notation slightly by writing a representative of each equivalence class. 
For example, it turns out that for the pair of 7-leaf trees mentioned in Section [T] 
there is only one equivalence class of parse words, so for those trees we write 

Parse Words(Ti, T 2 ) = {0110212}. 

Often we will take this representative to be the word in the equivalence class which 
is lexicographically first — words of the form or fe li>. However, we will de- 
part from this convention when convenient. The four color theorem is equivalent 
to the statement that for every pair of n-leaf binary trees T\ and T 2 we have 
ParseWords(Ti,T 2 ) ^ {}. 

The level of a vertex is its distance from the root. That is, the root lies on level 
0, the root's children lie on level 1, and so on. 
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A path tree is a binary tree with at most two vertices in each level. The 5-leaf 
path trees are as follows. 



The two leaves on level n — 1 in an n-leaf path tree are called the bottom leaves. 

The set of n-leaf path trees is in trivial bijection to the set {l,r} n ~ 2 of (n — 2)- 
length words on {I, r}: Since each level has at most two vertices, at most one vertex 
in each level has children, so we may form a word that records which child — left 
or right — has children at each level. We shall use this bijection to define several 
families of trees. 

Because of their linear structure, path trees are simpler to work with than binary 
trees in general, so the emphasis of this paper is on path trees. Indeed, several 
infinite families of pairs of path trees can be shown to satisfy Theorem [I] directly 
and have only a few parse words. We take up this task now. Some of the proofs 
work by finding out where the local conditions imposed by the two trees force a 
unique labeling and then just working out the consequences, so in some cases it 
may be quicker to prove the theorem for yourself than to read the proof provided. 

Let LeftCombTree(n) be the n-leaf path tree corresponding to the word l n ~ 2 . 
The left comb trees for n > 2 are pictured below. 



Let RightCombTree(n) be the n-leaf path tree corresponding to r™ -2 ; RightComb- 
Tree(n) is the left-right reflection of LeftCombTree(n). We warm up with some 
combinatories. 

Theorem 4. ParseWords(LcftCombTree(n), RightCombTrcc(n)) = 



Proof. We build a common parse word from left to right — up LeftCombTree(n) 
and down RightCombTree(n). At every leaf, each tree will eliminate one possible 
label, so the parse word will turn out to exist and be unique. 

The case n = 2 can be established by testing all words of length 2, so let n > 3. 
Without loss of generality we may label the first two leaves and 1. It follows 
from this that the root of RightCombTree(n) receives the label 1, the non-leaf 
(internal) vertex on the second level of RightCombTree(n) receives 2, and therefore 
the internal vertex on the third level of RightCombTree(n) receives 0. This implies 
(from the right comb) that the third leaf cannot receive 0. However, from the left 
comb we find that the third leaf cannot receive 2. Therefore the third leaf receives 
1. For the fourth leaf, the right comb precludes 2 and the left comb precludes 0, so 
the fourth leaf receives 1. Likewise all the way down the word through leaf n — 1. 
The internal vertex labels in each tree alternate between and 2, except for the 
root which receives 1. If n is odd then the lowest internal vertex in the right comb 





{01"- 2 2} if n> 2 is even 
{01 n - 2 0} ifn>3isodd. 



6 



Cooper, Rowland, and Zcilbcrgcr 



receives 2, so that the last leaf receives 0; if n is even then this internal vertex 
receives 0, and the last leaf receives 2. □ 

Note from the proof of this theorem that the internal labels corresponding to a 
common parse word of LeftCombTree(n) and RightCombTree(n) will match (top 
to bottom) if n is odd, and will differ by the permutation which swaps and 2 if n 
is even. 

Let LeftTurnTree(r7i, n) be the (m + n)-leaf path tree corresponding to l m r n ~ 2 , 
and let Right TurnTree(m, n) be the tree corresponding to r m l n ~ 2 . Each of these 
trees is formed by "gluing" together two comb trees. For example, 



LeftTurnTree(2, 3) = 




The following theorem is a special case of the general treatment of two turn trees 
given in Section [5] 

Theorem 5. For m > 1, 

ParseWords(LeftTurnTree(m, n), Right TurnTree(l, m + n — 1)) = 

{001"- 3 20 m ,021"- 3 00 m } ifn>3is odd 
{021"- 3 20 m ,001"- 3 00 m } ifn>4is even. 

Proof. Without loss of generality, label the last leaf of each tree 0. The roots of 
the trees receive the same label, and thus the respective parents of the last leaf of 
each tree must receive 1 and 2 in some order, and the first leaf must be labeled 0. 
This implies that the last m leaves are labeled 0. There are (up to permutation 
of 1 and 2) three possible options for the labels of leaves n — 1 and n (the bottom 
leaves of LeftTurnTree(m, n)), namely 12, 10, and 01. Each of the first two options 
can be seen to yield a unique common parse word as given in the statement of the 
theorem. The third option, in which leaves n — 1 and n are labeled 01, is not valid, 
since then the sibling of leaf n + 1 in Right TurnTree(l, m+n — 1) is labeled 0, which 
contradicts leaf n + 1 receiving 0. □ 

If w = W\W2 ■ ■ ■ w m is a word of length m and a; is a rational number whose 
denominator (in lowest terms) divides m, let 

w x = w^ xi wiw 2 ■ ■ -t<W(x-|xJ) 

be the word consisting of repeated copies of w truncated at mx letters. For example, 
(lr)V 2 = Mrlrl. 

Let LeftCrookedTree(n) be the path tree corresponding to {lr)^ n ^ 2 ^ 2 . The left 
crooked trees for n > 2 are as follows. 




Let RightCrookedTree(n) be the path tree corresponding to (rl)^ n ~ 2 ^ 2 — the left- 
right reflection of LeftCrookedTree(n). 

The next two results determine the common parse words of a comb tree and the 
completely crooked trees of the same size. Let w R be the left-right reversal of the 
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word w. Let mod(rt, 3) be the smallest nonnegative integer congruent to n modulo 
3. 

Theorem 6. Parse Words(LcftCombTree(n), RightCrookedTree(n)) = 

j'mod(l-n,3) ((012)"/ 6 ) fl (012)(™- 2 )/ 6 | if n > 2 is even 

[mod(l-7i,3) ((012)("- 3 )/ 6 )' R (012)( 7l+1 )/ 6 } ifn>3 is odd. 

Proof. One checks that for n = 2 the set of equivalence classes of parse words is 
{20}. 

Inductively, assume that LeftCombTree(n — 1) and RightCrookedTree (n — 1) 
parse the word claimed and that this is the only word they both parse (up to 
permutations of the alphabet). For even n — 1, the two bottom leaves of Right- 
CrookedTree (n — 1) are leaves and 22 y^. For odd n — 1, they are leaves § and 
Observe that for even n—1 the right bottom leaf of RightCrookedTree(n.— 1) 
receives 0, and for odd n — 1 the left bottom leaf of RightCrookedTree(n— 1) receives 
0. For n — 1 > 4 these are respectively the first and second of the two consecutive 
0s in the parse word. 

We attach A at the bottom of RightCrookedTree (n — 1) to form RightCrooked- 
Tree^) and insert A at the corresponding place in LeftCombTrcc(rt — 1) to form 
LeftCombTree(?i). Label the new bottom leaves of RightCrookedTree(n) 12 if n — 1 
is even and 21 if n — 1 is odd; we can label the corresponding leaves of LeftComb- 
Tree(n) the same by labeling their respective neighboring internal vertices and 1 
if n — 1 is even and and 2 if n — 1 is odd. The permutation 0—^2,1—^0,2—^1 
puts the new word in the form given in the theorem. 

This process is reversible, so every parse word for n comes from a parse word for 

71-1. □ 

The next theorem follows immediately from the previous theorem by labeling 
LeftCombTree(n) and RightCrookedTree(n) with a common parse word and then 
attaching the root of each tree as the left leaf of A. 

Theorem 7. ParseWords(LeftCombTree(n), LeftCrookedTree(n)) = 

|mod(2 - n, 3) ((OU)^-^ 6 ) 11 (012)("- 3 )/ 6 mod(2 - n, 3), 

o , ifn>3is odd 

mod(2 - n, 3) ((O^)^" 1 )/ 6 ) (012)("- 3 )/ 6 mod(-n, 3) j 

|mod(2 - 71, 3) ((012)("- 4 )/ 6 ) fl (012)™/ 6 mod(2 - n, 3), 

ff if n > 4 is even. 

mod(2-n,3) ((012)(™- 4 '/ 6 ) (012)"/ 6 mod(-n, 3) | 

Theorem 8. For n > 2, 

|ParseWords(LeftCrookedTree(n), RightCrookedTree(n))| = 2 L " /2J " 1 . 

Proof. The cases n = 2 and n = 3 are easily verified. In particular, every parse 
word of the 3-leaf pair consisting of 

is of the form aba for a ^ b, where the two roots also get labeled b. 

Let n be odd. Consider inductively extending LeftCrookedTree(n — 2) and Right- 
CrookedTree^ — 2) by 
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respectively to obtain LeftCrookedTree(n) and RightCrookedTree(n). Because the 
three new leaves are leaves (n — l)/2, (n + l)/2, and (n + 3)/2 in both LeftCrooked- 
Tree(n) and RightCrookedTree(n), every parse word 

WiW 2 ■ ■ ■ W(„-3)/2^(n+l)/2 ' ' ' W n _ 3 W n _ 2 

for the two (n — 2)-leaf crooked trees can be extended to a parse word 
wiw 2 ■ ■ ■ W( n _ 3 )/2a&<m>( n+ i)/2 • • • w n _ 3 w n -2 

for the two n-leaf crooked trees. Moreover, every parse word for the two n-leaf 
crooked trees can be obtained in this way. Since there are two choices for a, there are 
twice as many parse words for the n-leaf crooked trees as for the (n — 2)-leaf crooked 
trees, which establishes the statement for odd n; we see that w = w\w 2 ■ ■ ■ W n -\w n 
is a parse word for LeftCrookedTree(n) and RightCrookedTree(n) if and only if 
Wi = w n+ x-i ^ W(„+i)/ 2 for 1 < i < 

Let n be even. Then every parse word of the n-leaf crooked trees can be obtained 
by extending a parse word w\w 2 • ■ ■ w n / 2 -ibw n / 2+ i • • ■ w n -2Wn-i of the (n — l)-leaf 
crooked trees. Every parse word of A in which the root receives label b is of the 
form etc, where a =/= b and c b, so there are twice as many parse words for the 
n-leaf crooked trees as for the (n — l)-leaf crooked trees, which establishes the 
statement for even n. Specifically, w — wiw 2 • • ■ w n -iw n is a parse word for Left- 
CrookedTree(n) and RightCrookedTree(n) if and only if w n / 2 ^ w n / 2+ i and for 
some b € {0, 1, 2} we have Wi — w n +i-i ^ b for 1 < i < ^ — 1. □ 

4. General families 

Presumably explicit parse words can be found for various other parameterized 
families of tree pairs, but we now take a more general approach and establish 
results for tree pairs in which at least one of the trees does not come from a simple 
parameterized family. Some of these results will be used in Section [3J Note that, 
where stated, these results apply to not just path trees but binary trees in general. 

Proposition 9. Let n > 3. If the ith leaf is a bottom leaf in two n-leaf path trees, 
then the trees both parse the word o fc_1 10 Tl_fc for some 2 < k < n— 1. 

For example, this proposition applies to the pair in Theorem [8] consisting of 
LeftCrookedTree(n) and RightCrookedTree(n). 

Proof. If i = 1 then the second leaf is also a bottom leaf in both trees, so let k = 2; 
similarly, if i = n, let k = n — 1. If 2 < i < n — 1, let k = i. Labeling the fcth leaf 1 
and all other leaves produces a valid labeling of both trees because the internal 
vertices of the two trees on each level receive the same label, namely alternating 
between 2 and 1. □ 

We now give two propositions regarding extending a pair of binary trees by .*.. 

Proposition 10. Suppose T[ and T 2 are n-leaf binary trees with a common parse 
word. Extend each tree by attaching A to leaf i, obtaining T\ and T 2 respectively. 
Then 

|ParseWords(Ti,T 2 )| = 2|ParseWords(T{, T^)\. 
In particular, T\ and T 2 have a common parse word. 
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Proof. Let w be a parse word of T[ and T 2 . Without loss of generality we may 
assume that Wi = 0. Replacing u>i by 12 or 21 produces a word that both T\ and 
T 2 parse, and every parse word for the pair arises uniquely in this way. □ 

In the next proposition we consider extending a tree T by inserting A into the 
tree at an internal vertex to "duplicate" a leaf. Fix i, and let S be the tree hanging 
from the sibling vertex of leaf i. Remove S from its position, attach A to the sibling 
of leaf i, and then reattach S to a leaf of the new A as follows. If leaf i is a left leaf, 
attach S to the right leaf of the new A; if leaf i is a right leaf, attach S to the left 
leaf. Therefore if leaf i in T is a left leaf, then leaves i and i + 1 in the extended 
tree are both left leaves, and if leaf i in T is a right leaf, then leaves i and i + 1 in 
the extended tree are right leaves. We refer to this operation as duplicating leaf i. 

Proposition 11. Suppose T{ and T 2 are n-leaf binary trees with a common parse 
word. Extend T[ by attaching A to leaf i, obtaining T\. Extend T 2 to obtain T 2 by 
duplicating leaf i. Then 

I Parse Words (Ti , T 2 ) | = | ParseWords (T{ , T 2 )\. 

In particular, T\ and T 2 have a common parse word. 

Proof. Let id be a parse word of T[ and T 2 . Without loss of generality we may 
assume that u>i — and that the parent of leaf i in T 2 receives the label 1. 

If leaf i is a left leaf in T 2 , then T 2 parses the word obtained by replacing Wi by 
21 since duplicating leaf i in T 2 has the effect of the replacement 



at the parent of leaf i, which preserves the labels of all other vertices. If leaf i is a 
right leaf in T 2 , then T 2 parses the word obtained by replacing u>i by 12 since now 
the replacement is 

1 

/\ — > 02. 
s /\ 
s 1 

Clearly T\ parses both of these words, so we have found a parse word for the 
pair. Moreover, every parse word of T± and T 2 arises uniquely in this way. □ 

In Section [3] we referred to the two leaves of maximal depth in a path tree as 
bottom leaves. In a general binary tree, a bottom leaf is a leaf whose sibling is also 
a leaf. It is clear that for n > 2 every n-leaf binary tree has at least one pair of 
bottom leaves, and every binary tree that is not a path tree has at least two pairs 
of bottom leaves. We use these facts in the next two theorems. 

Theorem 12. Let n > 2, and let T be an n-leaf binary tree. Let I be the level of 
leaf I in T. Then | ParseWords (T, LeftCombTree(n))| = 2 1 ' 1 . 

By symmetry, the analogous result holds for the right comb. 



Proof. We work by induction on n. The only 2-leaf binary tree is LeftComb- 
Tree(2) = A, which has only one parse word up to permutation of the alphabet. 
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Let T be an n-leaf binary tree. Then T has a pair of bottom leaves; suppose 
these are leaves i and i + 1. Remove these two leaves to obtain T", which has 
n — 1 leaves. If i = 1, then Proposition |10| gives twice as many parse words for T 
and Left Comb Tree (n) as parse words for T' and Left Comb Tree (n — 1). If i > 1, 



then leaf i is a right leaf in LeftCombTree(n — 1), so Proposition 11 gives the same 



number of parse words as for T' and LeftCombTree(n — 1). □ 

Csar, Sengupta, and Suksompong [3] have recently provided a generalization of 
Theorem |12| They consider a partial ordering on the set of n-leaf binary trees 
arising from the rotation operation. They show that if Ti and T 2 are n-leaf binary 
trees whose join exists under this partial ordering, then |ParseWords(Ti, T 2 )| is a 
certain power of 2. 

The following theorem is an extension of Theorem [P2| to turn trees, although we 
lose the enumeration. 

Theorem 13. Let n > 4. Let T\ be an n-leaf binary tree and T 2 an n-leaf left turn 
tree. Then Ti and T 2 have a common parse word. 

Proof. We work by induction on n. For n — 4 the result can be verified explicitly. 

Now suppose that every (n — l)-leaf binary tree has a common parse word with 
every (n — l)-leaf left turn tree. Let Ti be an n-leaf binary tree, and let T 2 be an 
n-leaf left turn tree. Then Ti has a pair of bottom leaves; suppose these are leaves 
i and i + 1. 

First we consider the case where the ith leaf of T 2 is the right bottom leaf. If 
T\ is a path tree, then T\ and T 2 have a common parse word by Proposition [9] If 
T\ is not a path tree, then there is another pair of bottom leaves in Ti, so we may 
re-choose i if necessary so that the ith leaf of T 2 is not the right bottom leaf. 

Therefore we may assume that the ith leaf of T 2 is not the right bottom leaf. 
Remove leaves i and i + 1 from Ti to obtain T{ , which has n — 1 leaves and so has 
a common parse word with every (n — l)-leaf left turn tree. 

If the zth leaf of T 2 is the left bottom leaf, then we can apply Proposition [10] to 
obtain a common parse word for Ti and T 2 . Otherwise, leaves i and i + 1 occur on 



consecutive levels in T 2 , so Proposition 11 applies. □ 



5. A PAIR OF TURN TREES 

In this section we give three theorems that collectively determine the number of 
parse words of LeftTurnTree(m, n) and RightTurnTree(fc, m + n— k). Note that by 
Theorem [13] the number of parse words is nonzero. 

Theorem 14. For m > 1, k > 1, and max(2, k — rn + 2) < n < k, 

|ParseWords(LeftTurnTree(m, n), Right TurnTree(fc, m + n — k))\ = 1. 

Proof. The bottom leaves of LeftTurnTree(m, n) (which are leaves n — 1 and n) 
correspond to leaves which are on consecutive levels in Right TurnTree(fc, m + n — k), 
so we can apply Proposition [TT] to see that 

|ParseWords(LeftTurnTree(m, n), Right TurnTree(fc, m + n — k))\ 

= |ParseWords(LeftTurnTree(ni, n — 1), RightTurnTree(fc — l,m + n — k))\. 

Now, our hypothesis applies to this new, smaller tree pair, so we may continue 
reducing in the same way until we have reduced the right comb in the left turn tree 
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entirely away. At this point, we are considering the trees LeftTurnTrcc(TO, 2) = 
LeftCombTree(m + 2) and Right TurnTree(fc — (n — 2), m + n — k), which have a 
unique parse word class by Theorem |12| □ 



Let 



o(m, k) = |ParseWords(LeftTurnTree(m, k + 1), Right TurnTree(fc, m + 1))|. 



By considering the left-right reflections of these two trees, we see that a(m, k) — 
a(k,m). Theorem 15 determines the number of parse words of LcftTurnTree(m, n) 
and Right TurnTree(fc, m+n—k) for n > k + 2 in terms of a(m, k), and Theorem |17| 
evaluates a(m,k). 



Theorem 15. For m > 1, k > 1, and n > k + 2, 



|ParseWords(LeftTurnTree(m, n), Right TurnTree(fc, m + n — k))\ = 2a(m, k). 

Proof. If n > k + 2, then the bottom leaves of LeftTurnTree(m, n) correspond to 
leaves which are on consecutive levels in Right TurnTrcc(fc, m + n — k), so we can 
apply Proposition [TT] to see that 

|ParseWords(LeftTurnTree(m, n), Right TurnTree(fc, m + n — k))\ 

= |ParseWords(LeftTurnTree(m, n — 1), Right TurnTree(fc, m + n — fc — 1))|. 



If our hypothesis applies to this new, smaller tree pair, we may continue reducing 
in exactly the same way until we reach LeftTurnTree(m, k + 2) and Right Turn- 
Tree(fc,m + 2). Leaves k and k + 1 are bottom leaves in both these trees, so by 
Proposition [l0| we have 



|ParseWords(LeftTurnTree(m, n), Right TurnTree(fc, m + n — fc))| 

= 2|ParseWords(LeftTurnTree(m, k + 1), Right TurnTree(fc, m + 1))|. □ 



For the final result concerning the number of parse words of two turn trees, it 
turns out to be convenient to focus on the labels of the internal vertices rather than 
of the leaves. We form a word consisting of the internal vertex labels of a labeled 
path tree by reading these labels from top to bottom. 

A word on {0, 1,2} is alternating if no two consecutive letters are equal. If the 
internal vertices of a path tree are labeled with w, then the labeling can be extended 
to a parse word for the tree precisely when w is alternating. Therefore it will be 
important to know the sizes of certain sets of alternating words. Let A m be the set 
of length-m alternating words of the form 0«2 • • • v m , where V2, v m € {1,2}. Let B m 
be the set of length-m alternating words of the form 0^2 • • • v m , where v 2 £ {1,2} 
and v m £ {0, 2}. 

Proposition 16. Form > 2, \A m \ = (2 m +2(-l) m )/3 and \B m \ = (2 ro -(-l) m )/3. 
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Proof. Let aj(m) be the number of length-m alternating words on {0, 1,2} begin- 
ning with 01 and ending with mod(i, 3). Then 

a,i(m) = a,+i(m - 1) + a i+2 (m - 1) 

= a i+2 (m - 2) + 2a i+3 (m - 2) + a l+4 (m - 2) 

= a i+3 (m - 3) + 3a i+4 (m - 3) + 3a i+5 (m - 3) + a i+6 (m - 3) 



Yl {-jai+n+jim-n) 



j=0 



m-2 , _ ^\ 

X! ( • ) a i+m-2+j(2) 
j=0 V J ' 

'to - 2 



= S v , 

j — — (i+m) mod 3 

since a (2) = a 2 (2) = and di(2) = 1. Therefore 

«oM= E ( m 7 2 ) = | (2™- 2 + (-ir- 1 ) ■ 

j= — m mod 3 

Since ai(m) and a 2 (m) also count alternating words of the forms 02- --2 and 
02 • • • 1 respectively, we have 

\A m \ = 2oi(m) + 2a 2 (m) = 2 (2 m ~ 2 - a (m)) = | (2 m + 2(-l) m ) . 
Similarly, 

\B m \ = 2a a (m) + oi(m) + a 2 (m) - a (m) + 2 m - 2 - 1 (2 m - (-l) m ) . □ 

Next we provide a simple recurrence satisfied by a(m,k). Unfortunately, we do 
not know a correspondingly simple proof. 

Theorem 17. For m > 1 and k > 1, 

a(m + 3, k) - 2a(m + 2, k) - a(m + 1, k) + 2a(m, k) = 0. 

Initial conditions that suffice to completely determine a(m, k) from this recur- 
rence are o(l, 1) = 1, a(l,2) = 1, a(l,3) = 1, a(2,2) = 4, a(2, 3) = 5, and 
a(3, 3) = 3. The particular solution can be written as the matrix product 

/ 2/3 -2 m \ T /1/2 1 1 \ / 2/3 -2* 
a(m,k) = -\ 1 11-1 1 



4 



v 5/3-(-l)"7 \1 -1 1/5/ \5/3 • (-l) fc / 



Toward a language theoretic proof of the four color theorem 



13 



Proof. Let m > 2 and k > 2, and let A m and _B m be as above. Let 

i m (w) = {((oir /2 wMm m/2 )} , 
2 m (w) = {mr /2 ^M^) m/2 )} , 

A m {w) = {(vw, wv) : v e A m } , 
B ro ((01) fe/2 ) = {(«(10) fc / 2 , (01)^) :«£B m }. 

We consider the set of pairs (L,R) of length- (m + fc) (alternating) words such 
that -R = 01 • • • and such that respectively labeling the internal vertices of Left- 
TurnTree(m, k + 1) and Right TurnTree(/c, m + 1) with L and i? produces a parse 
word for the pair. Such pairs (L, R) are in bijection with equivalence classes of 
parse words for this tree pair as follows. The internal vertex labels of a path 
tree determine the labels of all leaves except the bottom leaves. Since Left Turn- 
Tree^, fc + 1) and Right TurnTree(fc, m + 1) do not share both bottom leaves, 
labeling the internal vertices with the pair (X, R) determines a unique parse word. 
We may choose representative parse words so that the internal vertex labels of 
Right TurnTree(fe, m + 1) begin with 01 (since the first two labels cannot be the 
same). 

Let w = 01 • • • be the length-A: prefix of R. Thus the internal vertices of the 
right comb of Right TurnTree ( k, m + 1) are labeled with letters from w, and the 
first letter of the parse word is 2. We show that if w contains all three letters then 
the set of pairs (L, R) is 

{} if w ends in and m is odd 

l m {w) U 2 m (w) if w ends in and m is even 

A m (w) if w ends in 1 or 2 and m is odd 

A m (w) U 2 m (w) if w ends in 1 and m is even 

A m (w) U l m {w) if w ends in 2 and m is even, 

and if w = (01) fe / 2 = 0101 • • • contains only two letters then this set is 

' {((01)(™+fe)/ 2 , (oi)(™+fe)/2)} if w cn ds in and m is odd 

l m (w) U 2 m (w) if w ends in and m is even 

A m {w) U B m (w) if w ends in 1 and m is odd 

A m (w) U B m (w) U 2 m (w) if w ends in 1 and m is even. 

To see this, first suppose that L = vw and R = wv for some v. Then v begins 
with 0, so w does not end in 0, and every v <E A m produces a parse word. 

Next suppose that L = v'w and R = wv for some v' ^ v. Then in fact v and v' 
differ in every position; in particular, v begins with some letter j ^ 0. Let i e {1, 2} 
such that i ^ j. Then the final leaf receives the label i since it is a child of a leaf 
in LeftTurnTrcc(m, k + 1) and a child of a j leaf in Right TurnTrcc(fc, m + 1). It 
follows that v — (jO)" 1 / 2 and v' = (0j)" 1 / 2 ; therefore m is even, and choosing j to 
be either 1 or 2 produces a parse word as long as it differs from the last letter of w. 

Finally, suppose that L = vw' for some length- A: word w' ^ w. Then w and 
w' differ in every position; in particular, w' begins with 1, and it follows that 
w = (01) fe / 2 and w' = (10) fe / 2 . If w ends in 0, then L = R= (Ql)( m + fe )/ 2 , yielding 



< 
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the parse word 2 k 02 m . If w ends in 1, then every v e B m produces a parse word, 
and R = wv. 

Since we know the sizes of all these sets by Proposition [16J we can enumerate 
the set of internal word pairs (L, R) and obtain an expression for a(m, k), which as 
expected is symmetric in m and k. For fixed k the expression is a linear combination 
of 2 m , 1, and (—1)™, so it satisfies the recurrence stated in the theorem, which can 
be written 

(M - 2)(M - 1)(M + 1) a(m, k) = 0, 

where M is the forward shift operator in the variable m. 

When m = 1 or k = 1 one of the two trees is a comb tree, and by Theorem [T2| we 
have a(m, k) = 1, which one checks is also what the general expression for a(m, k) 
gives upon setting m = 1 or k = 1. □ 



6. Reducing a pair of trees 

How might one proceed from the theorems of the previous sections to a proof 
that every two n-leaf path trees parse a common word? Here we introduce two 
notions of reducibility — ways to reduce the problem of finding a parse word for 
a pair of trees to finding parse words for smaller pairs — and give some related 
conjectures. 



6.1. Decomposable pairs. Recall that if T\ and T 2 are n-leaf trees such that 



leaves i and i + 1 are siblings in both trees, then Proposition 10 reduces the problem 
of finding a parse word for T\ and T2 to the problem of finding a parse word for 
the pair of (n — l)-leaf trees in which the common A has been removed. Our 
first observation is that there is nothing special about A; if the two trees have any 
common branch system in the same position, then we can decompose the trees. For 
example, the 8-leaf trees 



Ti = . T> 

share the branch system 





in the second through fifth leaves, which we may remove to obtain the 5-leaf trees 



Given a common parse word W1W2W3W4W5 of this pair of 5-leaf trees, we can find a 
common parse word of the original pair of 8-leaf trees by taking any valid labeling 
of S and permuting the alphabet so that the root receives the label u>2- 

In fact to decompose a pair of trees we only require a vertex in T\ with dangling 
subtree Si and a vertex in T 2 with dangling subtree 5*2 such that the leaves in S\ 
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and 5*2 are the same. For example, there are two such vertex pairs in the tree pair 




(where corresponding leaves have been given the same label). Breaking the trees 
at levels 2 and 8 as 




produces the same partition {{a, I}, {b, c, h, &:}, {d, e, f,g}} of the leaves in both 
trees. Thus, to find a parse word for the original pair it suffices to find parse words 
for the subtree pairs. Proposition [2] guarantees that we can reattach the subtrees 
consistently, since every binary tree that parses w receives the same label for its 
root when the leaves are labeled with the letters of w. Let us call a pair of path 
trees indecomposable if there is no such (nontrivial) decomposition. 

The tree pair in Theorem [4] consisting of LeftCombTree(n) and RightComb- 
Tree(n) is indecomposable, as is the pair in Theorem [6] consisting of LeftComb- 
Tree(n) and RightCrookedTree(n). On the other hand, breaking the trees Left- 
Comb Tree(n) and LeftCrookedTree(n) at level 1 shows that this pair is decompos- 
able, and in this case the decomposition accounts for the non-uniqueness of the 
equivalence classes of words in Theorem [7J 

The technique of decomposing trees is not limited to path trees. For example, 
the pair 
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can also be decomposed into the two pairs 



6.2. Pairs that are not mutually crooked. We showed in Proposition 11 that 
if leaves i and i + 1 are siblings in T\ and are on consecutive levels in T 2 then this 
pair of trees is reducible. It is natural then to ask whether a tree pair in which 
leaves i and i + 1 are on consecutive levels in both trees is reducible. 
First let us consider the pair 





which has the common parse word 0001220. The three consecutive 0s label leaves 
in both trees that are arranged in a right comb structure, and shortening each comb 
by two leaves produces the pair 



which parses 01220. In other words, we obtain a parse word for the larger pair by 
simply inserting two 0s corresponding to the two added leaves. Let us see why this 
works. In the first tree, the 4-leaf right comb subtree whose first three leaves are 
labeled has internal vertex labels 1,2, and 1; in the second tree, the corresponding 
4-leaf right comb subtree has internal vertex labels 2, 1, and 2. In both cases, the 
internal vertex labels alternate between 1 and 2, so shortening the comb by two 
leaves preserves the label of the root and the labels of the bottom leaves. Hence we 
still have a valid labeling. 

The situation will be the same even if the two combs have different orientations, 
and even if the trees are not path trees but binary trees in general. We formalize 
this as follows. The relevant extending operation is triplicating leaf i — inserting 
two A structures in T' so as to obtain three left leaves in T if leaf i is a left leaf in 
T' and three right leaves in T if leaf i is a right leaf in T' , analogous to duplicating 
a leaf as discussed in Section [4] 

Theorem 18. Let T{ and T 2 be (n — 2)-leaf binary trees. Let 1 < i < n — 2, and 
let T\ and T 2 be the n-leaf trees obtained from T{ and T 2 by triplicating leaf i. If 
w = w\ ■ ■ ■ w n -2 is a parse word for T{ and T 2 , then W\ ■ ■ ■ Wi-iWiWiWiWi+i • • ■ w n ^ 2 
is a parse word for T± and T 2 . 

A pair of n-leaf trees T\ and T 2 is weakly mutually crooked if it cannot be obtained 
by triplicating some leaf i in a pair of (n — 2)-leaf trees. To prove that every pair of 
binary trees has a parse word, by the previous theorem it suffices to consider pairs 
of weakly mutually crooked trees. 

However, it appears that something stronger is true. A pair of n-leaf trees T\ 
and T 2 is mutually crooked if it cannot be obtained by duplicating some leaf i in a 
pair of (n— l)-leaf trees. That is, no pair of consecutive leaves has an uncle-nephew 
relationship in both trees. For example, the pair parsing 0110212 in Section [T] is 
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mutually crooked. Experimental evidence suggests that in fact it suffices to consider 
pairs of mutually crooked trees. 

Conjecture 19. Let T[ and T 2 be (n — l)-leaf binary trees. Let 1 < i < n — 1, 

and let T\ and T 2 be the n-leaf trees obtained from T[ and T 2 by duplicating leaf i. 
There exists a parse word w — w\ ■ ■ ■ w n of T\ and T 2 such that uii — l/Jj+i. 

For example, in the pair discussed at the beginning of this subsection, leaves 5 
and 6 are on consecutive levels in both trees, and these leaves receive the label 2. 
Note however that the parse word of Ti and T 2 is not necessarily a simple extension 
of a parse word of T[ and T 2 . 

The trees LeftCrookedTree(n) and RightCrookedTree(n) (which we addressed in 
Theorem [8]) are mutually crooked, but for n > 5 no path tree is mutually crooked 
to LeftCombTree(n), since even a completely crooked tree has a pair of consecutive 
leaves that lie in consecutive levels. Theorems [4] and [5] provide additional examples 
of pairs that fail to be mutually crooked. 

6.3. Other conjectures. To prove that every pair of n-leaf binary trees T\ and T 2 
has a parse word, it therefore suffices to consider indecomposable, weakly mutually 
crooked pairs of trees. In particular, we may assume that the leaves on level 1 in 
T\ and T 2 are different, since if they are the same then the pair is decomposable at 
level 1 into smaller pairs. 

Theorem 20. Let n > 3, and let T\ and T 2 be n-leaf path trees such that leaf 1 is 
on level 1 in T\ and leaf n is on level 1 in T 2 . Then T\ and T 2 have no parse word 
of the form Olvl. 

Proof. For n — 3 one checks that Oil is not a parse word for one of the two 3- 
leaf binary trees. Assume n > 4. Toward a contradiction, suppose that Olvl is a 
common parse word for some v. Then the root of each tree receives the label 2, and 
the internal vertex on level 1 of T\ receives 1. Consider the children of this internal 
vertex. If the right child is a leaf, then it is leaf n and so receives 1, which is not a 
valid label because its parent is already labeled 1. If the left child is a leaf, then it 
is leaf 2 and so receives 1, which is also not a valid label. □ 

A similar argument shows that if there is a parse word of the form 01u2, then 
leaf 2 of T\ is on level 2, and leaf n — 1 of T 2 is on level 2. 

The following conjecture gives several statements that seem to be true and may 
be helpful in proving Theorem [I] for path trees directly. 

Conjecture 21. Let n > A, and let T\ and T 2 be n-leaf path trees such that leaf 1 
is on level 1 in T\ and leaf n is on level 1 in T 2 . Then we have the following. 

• // Ti and T 2 have no parse word of the form OOv or vOO, then they have a 
unique parse word (up to permutation of alphabet). 

• // Ti and T 2 have no parse word of the form OOv and are mutually crooked, 
then they have a parse word of the form OluOO. 

• If Ti and T 2 have no parse word of the form OOv, then the only possibilities 
for the 2-tuple 

(level of leaf 2 in Ti, level of leaf n — 1 in T 2 ) 

are (2, 3) and (fc, 2) for some k > 2. 

Moreover, if Ti and T 2 are weakly mutually crooked, the only possibilities 
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are (2, 3) and (k, 2) for some 2 < k < 4. 

Moreover, ifT\ andT 2 are mutually crooked, the only possibilities are (2,3) 
and (fc, 2) for some 2 < k < 3. 

Finally, we give an interesting conjecture that has been explicitly verified for 
n < 12. (The statement does not hold for general binary trees.) 

Conjecture 22. Let n > 4. Every pair of n-leaf path trees parses a word of the 
form uOOv for some (possibly empty) u,v. 
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